Special 2-flags in lengths not exceeding four: 
a study in strong nilpotency of distributions 
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Abstract 

In the recent years, a number of issues concerning distributions generating 1- 
flags (called also Goursat flags) has been analyzed. Presently similar questions are 
discussed as regards distributions generating multi-Rags. (In fact, only so-called 
special multi-flags, to avoid functional moduli.) In particular and foremost, special 
2-flags of small lengths are a natural ground for the search of generalizations of 
theorems established earlier for Goursat objects. In the present paper we locally 
classify, in both C w and C°° categories, special 2-flags of lengths not exceeding four. 
We use for that the known facts about special multi-flags along with fairly recent 
notions like strong nilpotency of distributions. In length four there are already 34 
orbits, the number to be confronted with only 14 singularity classes - basic invariant 
sets discovered in 2003. 

As a common denominator for different parts of the paper, there could serve the fact 
that only rarely multi-flags' germs are strongly nilpotent, whereas all of them are 
weakly nilpotent, or nilpotentizable (possessing a local nilpotent basis of sections). 

1 Definition of special /c-flags and their singularities 



Special fc-flags (the natural parameter k > 2 is sometimes called 'width') of lengths 
r > 1 can be defined in several equivalent ways, like in [KRub], [PaR], [M2]. All these 
approaches can be reduced to one transparent definition. (The reduction is via two early 
\ Bryant's results from [B], one lemma from [PaR], and the answer to a recent question of 
Zhitomirskii, cf. p. 165 in [M2].) 

Namely, for a distribution Dona manifold M, the tower of consecutive Lie squares of D 

D = D r C D r - 1 C D 1 - 2 C ■ • • C D 1 C D° = TM 

(that is, [_D J , D^] = D^ 1 for j = r, r — 1, . . . , 2, 1) should consist of distributions of ranks, 
starting from the smallest object D r : k + 1, 2k + 1, . . . , rk + 1, (r + l)k + 1 = dimM 
such that 
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• for j — 1, . . . , r — 1 the Cauchy-characteristic module L(D j ) of D j sits already in the 
smaller object D^ +1 : L{D^) C -D J+1 and is regular of corank 1 in while L(D r ) = ; 

•• the covariant subdistribution F of -D 1 (see [KRub], p. 5 for the definition extending the 
classical Cartan's approach from [C], p. 121) exists and is involutive. Note that, in view 
of Lemma 1 in [KRub], such an F is automatically of corank 1 in D 1 ; the hypotheses 
in that lemma are satisfied as rk [D 1 , D 1 ]/ D 1 = k > l. 1 

Attention. Recently new works [Ad] and [SY] have appeared, revisiting, among other 
subjects, the very definition of special multi-flags. In the light of those works, the extensive 
condition • in the definition above is redundant. This condition follows from •• and the 
property of regular dimension growth of the flag of consecutive Lie squares of the initial 
distribution D, so-called the big flag of D. Things being so, the entire theory of special 
multi-flags starts to appear more compact (the more compact the better). 

Note also that different properties of Cartan's original object discussed in [C] (treated 
nowadays in the genericness' context as a local module of vector fields) were grouped 
together in [MPel]. 

Special multi-flags, and in particular special 2-flags, appear, from the one side, to be 
rich in singularities, and from the other - to possess finite-parameter families of pseudo- 
normal forms, with no functional moduli. It is natural, then, to search for precise normal 
forms for them, at least in small lengths. Realizing well that, from certain length onwards, 
some parameters may prove genuine moduli, as we, besides, rigorously exemplify in Section 
1.5. (Our example is in length r = 7 and works, in fact, in all widths k > 2, not only for 
k — 2. It is likely that moduli of special multi-flags exist already in length six. Moreover, 
the length of the onset of moduli may decreasingly depend on flags' width.) 

In the parallel framework of 1-flags (most often called Goursat or Cartan-Goursat) 
similar questions have led to lists of exact local models in lengths not exceeding seven 
and to the discovery of real moduli in lengths from eight on. A distinctive feature of 
Goursat flags is that for them the property • comes in automatically and that there is 
plenty of involutive corank one subdistributions of D 1 (cf. ••), although none of them 
is canonical, while the covariant subdistribution of D 1 is simply L^D 1 ). The key tool 
for 1-flags, sufficient up to length six, has been the Jean stratification [J] describing in 
geometric terms (if only implicitly) the sequences of consecutive singularities showing up 
in 1-flags. 

For special 2-flags, it is not possible to follow that way too closely, although one 
natural stratification, into singularity classes, exists ([M3, M4]). It does not, however, 
correspond to Jean's one, but rather to a much coarser stratification of Goursat objects 

1 Equivalently, using Tanaka's and Yamaguchi's approach [T, Y] (well anterior to [KRub] and not 
designed for special flags, although applicable for them), one stipulates in •• two things: — the distribution 
D 1 = D 1 /L(D 1 ) of rank k + 1 on a manifold of dimension 2k + 1 is of type k) of [T] and, as such, 

possesses its symbol subdistribution F C D 1 ([Y], p. 30) and — F is involutive (cf. Prop. 1.5 in [Y]). F 
is then the counterimage of F under the factoring out by L(D 1 ). Thus, for special fc-flags, k > 2, the 
stipulated involutive corank one subdistribution of D 1 is at the same time: the covariant subdistribution 
in the Cartan-Kumpera-Rubin sense and symbol subdistribution in the Tanaka-Yamaguchi sense. 
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into Kumpera-Ruiz classes - cf. [MonZ], p. 466. Jean-like singularities in special 2-flags 
(and all the more so for k > 2) seem to be incredibly rich and escaping any reasonable 
ordering. Places resembling his approach can be detected in the present work: we often 
distinguish between transverse and tangent, but it is worlds apart from the regular ternary 
tree of 'basic geometries' of 'car + trailers' systems. 

Putting things simply, in 2-flags there is much more room for singular positions than 
in 1-flags. Already in length three the singularity classes evoked above fail to fully describe 
the orbits of the local classification; one of them splits up into three orbits. In addition, a 
fairly new notion of strong nilpotency ([Ml]) appears to be useful. It allows to completely 
describe all orbits in lengths 3 and 4, and can be perceived as a key notion of the present 
work. With its use we show - this is our main result - that there are 34 orbits of the local 
classification (in both smooth and analytic category) of special 2-flags of length four, as 
contrasted with only 14 singularity classes in that length. In this way the length four 
appears to be 'discrete' yet, with no moduli whatsoever. 

It is to be underlined at this place that the local classification of special /c-flags appears 
to be stable with respect to the width k > 2 for lengths r < 3, but already not, for various 
(not all predictable) reasons, for r = 4. Compare in this respect Remark 4 and Section 
7.7. 

1.1 Sandwich Diagram for multi-flags. 

All these requirements merge naturally into a sandwich diagram. 2 Note that the inclusions 
L(D- 7_1 ) D L(D^) in its lower line are due to the Jacobi identity. 
TM = D° D D 1 D D 2 D D 3 ■■■ D r ~ l D D r 
U U U U U 

F D L(D r ) D L(D 2 ) ■■■ L(D r ~ 2 ) D L(D r ~ l ) D L(D r ) = . 
As for the inclusion F D L(D 1 ), it follows from [KRub] and, besides, is a part of the 
answer to the question mentioned in the previous paragraph. All vertical inclusions in 
this diagram are of codimension one, while all (drawn, we do not mean superpositions of 
them) horizontal inclusions are of codimension k. The squares built by these inclusions can, 
indeed, be perceived as certain 'sandwiches'. For instance, in the utmost left sandwich F 
and D 2 are as if fillings, while D 1 and L(D 1 ) constitute the covers (of dimensions differing 
by k + 1, one has to admit). At that, the sum k + 1 of codimensions, in D 1 , of F and 
D 2 equals the dimension of the quotient space D l /L(D l ), so that it is natural to ask 
how the /c-dimensional space F/L(D l ) and the line D 2 /L(D 1 ) are mutually positioned 
in D 1 /L(D 1 ). Similar questions impose by themselves in further sandwiches 'indexed' by 
the upper right vertices D 3 , D 4 , . . . , D r . 

2 so called after a similar (if not identical) diagram assembled for Goursat distributions, or 1-flags, in 
[MonZ] 
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1.2 Analogues for special multi-flags of Kumpera-Ruiz classes. 



We first divide all existing germs of special fc-flags of length r into 2 r_1 pairwise disjoint 
sandwich classes in function of the geometry of the distinguished spaces in the sandwiches 
(at the reference point for a germ), and label those groups by words of length r over the 
alphabet {1,2} starting (on the left) with 1, having the second cipher 2 iff D 2 (p) C F(p), 
and for 3 < j < r having the j-th cipher 2 iff D^(p) C L{D^~ 2 ){p). 

This construction puts in relief possible non-transverse situations in the sandwiches. For 
instance, the second cipher is 2 iff the line D 2 (p) / L(D l )(p) is not transverse, in the space 
D 1 (p)/L(D l ), to the codimension one subspace F(p)/L(D 1 )(p), and similarly in further 
sandwiches. This resembles the Kumpera-Ruiz classes of Goursat germs constructed in 
[MonZ] . In length r the number of sandwiches has then been r — 2 (and so the # of KR 
classes 2 r ~ 2 ). For multi- flags this number is r — 1 because the covariant distribution of D 1 
comes into play and gives rise to one additional sandwich. 

How can one ascertain if such virtually created sandwich classes really materialize, 
and, if so, how to possibly sort them further? In the present paper we restrict ourselves to 
k = 2, whereas the general construction (in the framework of multi-dimensional Cartan 
prolongations) is given in [M2]. We will produce a huge variety of polynomial germs at 
G M. N , N possibly very large (odd), of rank-3 distributions. Often - this is important - 
certain variables Xj will appear in them in a shifted form b + Xj, and it will always be an 
issue if such shifting constants are rigid or flexible, subject to further simplifications. More 
precisely, for each m G {1, 2, 3} we are going to define an operation m producing new 
rank-3 distributions from previous ones. Technically, its outcome (indices of new incoming 
variables) will also depend on how many operations have been done before m. 

More specifically, the outcome of m - being performed as operation number / - on 
a distribution (Zi, Z 2 , Z 3 ) defined in the vicinity of G M s (wi, . . . , u s ), is the germ at 
G 1R s+2 (-Ui, . . . , u s , Xi+i, yi+i) of a new rank-3 distribution generated by 

{Zi + (bi+i + xi +1 )Z 2 + (q+i + yi+i)Z 3 , when m = 1, 
x l+1 Zi + Z 2 + (q+i + yi+i)Z 3 , when m = 2, 

x l+l Z 1 + y l+1 Z 2 + Z 3 , when m = 3 

and Z' 2 = Q ® , Z' 3 = q^^', b and/or c are certain constants (depending on the germ 
under consideration). For any possible next such operation (and one is bound to perform 
many of them) it is important that these local generators are written precisely in this 
order, yielding together a new 'longer' or more involved distribution (Z[, Z 2 , Z' z ). Note 
that two operations 1 and 2, out of three typically available, bring in new numerical 
parameters (adding to possibly already existing previous parameters). 

Extended K-R pseudo- normal forms (EKR for short), of length r > 1, denoted by 
ji- J2 • • • jr, where j±, . . . , j r G {1, 2, 3} and depending on numerous numerical param- 
eters within a fixed symbol ji. j 2 • • • j r , are defined inductively, starting from the empty 
label distribution 

/ d d d \ 
\<9t' dx± ' dy±J 
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understood in the vicinity of G IR 3 (t, x\, yi). Then, assuming the family of pseudo- 
normal forms ji. . . jr-i already constructed and written in coordinates that go along with 
the operations: first ji, then j 2 and so on up to j r _i, the normal forms subsumed under 
the symbol . . j r _i. j r are the outcome of the operation j r performed as the operation 
number r over those coordinately written distributions ji. . . j r -i- 

For a moment, it is nearly directly visible that every EKR is a special 2- flag of length 
equal to the number of operations used to produce it. In particular, it is easy to predict 
what are, for any EKR of length r, the involutive subdistributions of ranks 2, 4, ... , 2r; 
see Observation 1 below. The point is that locally the converse is also true, and one has 

Theorem 1 ([M2]). Let a rank-3 distribution D generate a special 2- flag of length r > 1 
on a manifold M 2r+3 . For every point p G M, D in a neighbourhood of p is equivalent, by 
a local diffeomorphism that sends p to 0, to a certain EKR j1.j2--.jr in a neighbourhood 
of G IR 2r+3 . Moreover, that EKR can be taken such that ji = 1 and the first letter 2, 
if any, appears before the letters 3. 

The restriction on the EKR codes mentioned in this theorem is called, after [M2], the 
rule of the least upward jumps: after the starting 1, and possibly several more l's, there 
must first appear a 2 and only later a 3, if any. Note also that possible constants in the 
EKR's representing a given germ D are not, in general, defined uniquely, as shows already 
Example 1. For r < 4 this, in all EKRs, is duly analyzed in the present contribution, and 
conclusions differ sometimes from natural expectations. 

Example 1. The EKR 1.1... 1 (r ciphers 1) subsumes a vast fan of different pseudo- 
normal forms - germs at G IR 2r+3 parametrized by real parameters b 2 , C2, ■ ■ ■ , br+i, Cr+i- 
Under a closer inspection (Theorem 1 in [KRub]), they all are pairwise equivalent, and 
are equivalent to the classical Cartan distribution (or jet bundle in the terminology of [Y]) 
on the space J r (l, 2) of the r-jets of functions K. — > R 2 , given by the Pfaffian equations 

dxj — Xj + idt = = dyj — yj+idt , j — 1, 2, . . . , r . 

All other EKRs are not equivalent to the jet bundles, as is explained in Proposition 1 
below. It is to be noted that the question of a geometric characterization of Cartan distri- 
butions was addressed in many works and, in full generality (for all jet spaces J r (m, k)), 
was answered in [Y]. 

1.3 The EKR's versus sandwich classes. 

What relationship exists between the sandwich class of a given germ of a special 2-flag 
and its all possible EKR presentations? A key tool for answering this question is 

Observation 1. If a distribution D = D r generating a special 2-flag of length r > 1 is 
presented in any EKR form on M. 2r+3 (t, x±, yi, x r+ i, y r +i), then the members of the 
associated subflag in the sandwich diagram for D r are canonically positioned as follows. 

• F = (d/dx 2 , d/dy 2 , d/dx 3 , d/dy 3 , d/dx r+1 , d/dy r+ i), 
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• L(D j ) = ( d/dx j+2 , d/dy j+2 , • • • , d/dx r+1 , d/dy r+ i ) for j < r - 1, 



These extremely simplified descriptions are the analogues of similar ones for Goursat 
flags when viewed in Kumpera-Ruiz coordinates. Another analogue (a derivative product 
of Observation 1) is 

Proposition 1. Assume a germ, D, of a special 2-flag of length r sits in a sandwich class 
having the label S. Then, for any EKR j x . . . j r _i. j r for D, j; = 1 iff the l-th cipher in 
£ is 1. 

Therefore, the singular phenomena - pointwise inclusions in sandwiches do narrow (to 2 
and 3) the pool of operations available at the relevant steps of producing EKR visualisa- 
tions for special 2-flags. 

Proof, ji is by default 1 and the first cipher in S is by definition 1. Consider now j;, 
/ > 2, and recall that the operation j/ transforms certain EKR (Z±, Z 2 , Z 3 ) into an EKR 
(Z[, Z' 2 , Z f 3 ). When j; is either 2 or 3, then, by definition of these operations, Z[ = 
x l+l Z 1 mod(Z 2 , Z 3 ), where Z 2 = £- { and Z 3 = (As for Z' 2 = and Z' 3 = 
they cause no trouble in the discussion.) Whereas for j/ = 1, Z[ = Zi mod (Z 2 , Z 3 ) and 
a non-zero vector -Zi(O) is, by its recursive construction (in I — 1 steps), spanned by 

d/dt, d/dx 1 , d/dy!, d/dxi-x, d/dyi-i. 

Hence, in view of Observation 1, Z^O) does not sit in: F(0) when I = 2, and L(D'" 2 )(0) 
when I > 2. □ 

Remark 1. When k — 1, two operations instead of three (1,2,3) in the present text, 
lead to the well-known local Kumpera-Ruiz pseudo-normal forms for Goursat flags. 

1.4 Singularity classes of special 2-flags refining the sandwich 
classes. 

We refine further the singularities of special 2-flags and recall from [M3] how one passes 
from the sandwich classes to singularity classes. In fact, to any germ J 7 of a special 
2-flag associated is a word W(J r ) over {1,2,3}, called 'singularity class' of J 7 . It is a 
specification of the word 'sandwich class' for J 7 (this last being over, reiterating, {1,2}) 
with the letters 2 replaced either by 2 or 3, in function of the geometry of J 7 . 

In the definition that follows we keep fixed the germ of a rank-3 distribution D at 
p G M, generating on M a special 2-flag J 7 of length r. 

Suppose that in the sandwich class C of D at p there appears somewhere, for the first 
time when going from the left, the letter 2 = j m (j m is, as we know, not the first letter 
in C) and that there are in C other letters 2 = j s , m < s, as well. We will specify each 
such j s to one of the two: 2 or 3. (The specification of the first j m will be made later and 
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will be easy.) Let the nearest 2 standing to the left to j s be 2 = j t , m < t < s. These two 
'neighbouring' letters 2 are separated in C by I = s — t — 1 > letters 1. 

The gist of the construction consists in taking the small flag of precisely original flag's 
member D s , 

D s = V 1 C V 2 C V 3 C V 4 C V 5 C • • • , 

Vi + i = Vi + [D s , Vi], then focusing precisely on this new flag's member V2/+3. Reiterat- 
ing, in the t-th sandwich, there holds the inclusion: F(p) D D 2 (p) when t — 2, or else 
L(D t ~ 2 )(p) D D t (p) when t > 2. This serves as a preparation to an important point. 
Surprisingly perhaps, specifying j s to 3 goes via replacing D 1 by in the relevant 

sandwich inclusion at the reference point. That is to say, j s = 2 is being specified to 3 iff 
F(p) D V 2 i+3(p) (when t = 2) or else L(D t_2 )(p) D V 2 /+3(p) (when t > 2) holds. 

In this way all non-first letters 2 in C are, one independently of another, specified to 2 
or 3. Having that done, one simply replaces the first letter 2 by 2, and altogether obtains 
a word over {1, 2, 3}. It is the singularity class W(J r ) of J 7 at p. Thus created W(J r ) 
clearly satisfies the least upward jumps rule. 

Example 2. In length 4 there exist the following fourteen singularity classes: 1.1.1.1, 
1.1.1.2; 1.1.2.1, 1.1.2.2, 1.1.2.3; 1.2.1.1, 1.2.1.2, 1.2.1.3, 1.2.2.1, 1.2.2.2, 1.2.2.3, 1.2.3.1, 
1.2.3.2, 1.2.3.3. 

Do singularity classes surge to surface in the mentioned local polynomial pseudo- 
normal forms EKR, as the sandwich classes have done? Yes, the EKR's are faithful to the 
underlying local flag's geometry epitomized in the singularity class, and there holds 

Theorem 2 ([M3, M4]). For every germ D of a rank-3 distribution generating a special 
2-flag of length r > 1, and for every its pseudo-normal form of the type jij2 . . -j r (subject 
to the least upward jumps rule), the word j\-3i ■ ■ ■ 3r is but W(D). 
In particular, the singularity class of any EKR form ji. j 2 . . . j r; regardless of its constants, 
is ji-32---3v 

This theorem shows additionally that all defined singularity classes are non-empty. How 
many singularity classes do there exist for special 2-flags, and of what codimensions are 
they? 

On each manifold M of dimension 2r + 3 bearing a special 2-flag of length r, the shadows 
of singularity classes (one says also about materializations of singularities) form always 
- and not only for 'generic' flags! - a very neat stratification by embedded submanifolds 
whose codimensions are directly computable. Namely, 

Proposition 2. The codimension of the materialization of any fixed singularity class C 
is equal, provided the materialization is non-empty, to 

the number of letters 2 in C + twice the number of letters 3 in C . 

Once Theorem 2 shown, one proves this statement locally, using any fixed EKR depicting 
locally the flag in question. 
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The number of different singularity classes of special 2-flags of length r > 3 is 

1 -I- V ^ 1 

2 + 3 + 3 2 + --- + 3 r " 2 = + 2 . (1) 

(One focuses attention on the position of the first letter 2 in the class' code, remembering 
that the codes satisfy the least upward jumps rule: no letter 2 or else that letter at the 
very end - account for the summand 2, that letter at the one before last position accounts 
for the summand 3, and so on. Then that letter at the second position accounts for the 
biggest summand 3 r ~ 2 .) 



1.5 Moduli among parameters in pseudo-normal forms. 

Once the singularity classes (in the present paper - only for 2-flags) and faithful to them 
pseudo-normal forms EKR have been recalled, one of the first imposing questions is that 
about the status of real parameters entering the EKR forms. The same question con- 
cerning parameters in normal forms for germs of 1-flags, sparked by the benchmark work 
[KRui], had remained without answer over a considerable period 1982-97. 

With examples of moduli of 1-flags at hand, it is not long to produce an example of 
an EKR parameter that is a true modulus. To this end, choose the following family of 
EKR's 1.2.1.2.1.2.1 sitting (see Theorem 2) in the singularity class 1.2.1.2.1.2.1: 



dx\ — %2dt 


= 


dyi 


- y2dt 


= 


dt — x^dx2 


= 


dy 2 - 


y 3 dx 2 


= 


(1 + x 4 )dx 2 


= 


dy 3 - 


y 4 dx 2 


= 


dx 2 — x 5 dx 4 


= 


dy 4 - 


y 5 dx 4 


= 


(1 + x e )dx 4 


= 


dy 5 - 


y 6 dx 4 


= 


dx 4 — xjdxQ 


= 


dy% - 


y 7 dx 6 


= 


- (c + x 8 )dx e 


= 


dy 7 - 


ysdx 6 


= 0, 



(2) 



where c 6 1 is an arbitrary real parameter and these objects are considered as germs 
at G M 17 (t, Xi, yi,..., xs, 1/8)- (Due to the Pfafhan equations' description, it is not 
instantly visible that the objects sit in an EKR. Yet, by the time we prove the statement 
in Appendix (Section 8), it will be clear that the proposed objects belong to a concrete 
EKR class of normal forms). The proof is being postponed to keep the exposition balanced. 

Remark 2. (a) The 1-parameter family in (2) is, as it stands, written for the width k — 2 
(there are only two columns of Pfafnan equations). However, a similar family could be 
proposed for any bigger width. The reader can easily figure out the potential 3rd, . . . , 
kth columns, all constructed on the pattern of the second column, with no additional 
constants (the non-zero constants, decisive for the example, always in the first column 
only). The proof for the analogous objects inside the EKR class 1.2.1.2.1.2.1 in the space 
of special fc-flags, k > 2, would be essentially the same, only the basic vector equation 
would be longer and so would be equations on the levels X 5 and X 3 . 
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(b) The germs of special k- flags equivalent to these in (2), or in analogous families for 
k > 2, are thus uni- or more- modal (their modality in Arnold's sense is at least one). 
We suspect that their true modality is either two or three. A lot of work is needed in 
this direction. Already the analysis of the class 1.2.1.2 in section 5.2 indicates possible 
complications. 

Remark 3. We want to note that the problems of local classification of special fc-flags, 
k > 2, (and of 1-flags, too) have important affinities with those of local classifying of 
unparametrized curves in R fc+1 . That is, with the R-L classification of germs of mappings 
R — > R fc+1 , although the two sets of problems are not the same. (In the 1-flags case, which 
stands out by the lack of a canonical analogue of the covariant subdistribution F, we 
mean the local classification of unparametrized contact curves in the contact space R 3 .) 
The first researchers who had gradually (from 1999 onwards) discovered those remarkable 
affinities were Montgomery and Zhitomirskii. From 2003 there has also been an important 
influx of ideas by Ishikawa. Later in section 7.7 we give, with quotations from [GHo, Ar], 
a concrete example of a striking (if only partial) interplay between the two fields. 

1.6 Simple local construction of 2-flags of length 1 and 2. 

Before dealing with the special 2-flags in lengths 3 and 4, we briefly survey the lengths 1 
and 2 in which the bare sandwich classes are the orbits. 

Theorem 3 ([KRub]). (i) Any special 2- flag of length 1 can be locally brought to the 
following particular EKR 1 

dx\ — x 2 dt = dyi — y 2 dt = 

displaying no constants. 

(ii) Any germ of a special 2-flag of length 2 sitting in the generic sandwich class 1.1 can 
be brought to the following EKR 1.1, 

dxi — x 2 dt = dyi — y 2 dt = 

dx 2 — x%dt = dy 2 — y%dt = . 

Every germ of a special 2-flag of length 2 in the sandwich class 1.2 of codimension 1 can 
be written as the following particular 1.2, 

dxi — x 2 dt = dyi — y 2 dt = 

dt — x 3 dx 2 = dy 2 — y^dx 2 = . 

Proof. Lemma 2 and Theorem 1 in [KRub] imply that 1 and 1.1 are single orbits, and 
that 1 exhausts, up to local equivalence, all special 2-flags of length 1. That 1.2 is a single 
orbit and that there are only two orbits in length 2, is explicitly written (albeit without 
proof) on p. 10 8 ~ 10 in [KRub]. Here is a short explanation. 
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It is clear from Theorem 1 that the EKR families 1.1 and 1.2 do cover all orbits in length 
2. We work with the latter family and take into account the simplification coming from 
item (i), thus having the members of 1.2 brought to the form 

dxi — x 2 dt = dyi — y 2 dt = 

dt - x^dx 2 = dy 2 — (c + ys)dx 2 = . 

In order to get rid of c there suffices to just take two new (bar) variables y 2 = y 2 + cx 2 and 
Hi — Vi + cx i- It works because plugging the new expression for y 2 in dy\ — y 2 dt brings in 
the term x 2 dt which is but dx\ due to the first Pfaffian equation in the left column. □ 

2 Strong versus weak nilpotency (in length three) 

It is known since certain time (Theorem 4 in [M2]) that, on top of the Goursat distri- 
butions, also all special /c-flags, and all the more so special 2-flags, are locally nilpoten- 
tizable, or: weakly nilpotent in the actually prevailing terminology. In fact, local bases 
given in the EKR presentations for them are nilpotent, and of nilpotency orders that 
can be effectively computed. On the other hand, only a tiny portion of germs of special 
A;-flags seems to be strongly nilpotent in the precise sense of [AGau] and [Ml]; that is, 
equivalent to their relevant nilpotent approximations. (Nilpotent approximations of distri- 
butions had been investigated by numerous researchers, with outstanding contributions 
[ASa, AGamSa, BiSt, Be]; see also [A] for an important coordinate-free description.) 

This phenomenon has been discovered recently, [Ml], amongst Goursat distributions. 3 In 
the present work it turns out to be of key importance in handling special 2-flags in lengths 
exceeding those of Theorem 3. For, in view of this theorem, the neatest EKR's available 
in these small lengths display no constants. Whence, by the last item of Theorem 4 of 
[M2], 

Observation 2. All germs of special 2-flags in lengths 1 and 2 are strongly nilpotent. 

(As a matter of record, in these lengths, the same is true for special flags of any width k.) 

Among 2-flags of length 3, one singularity class stands out by its complication. It is 
1.2.1, visualised - see Theorem 2 - by the EKR's in the family 1.2.1. Most of the germs 
in 1.2.1 appear not to be strongly nilpotent. In order to see this clearly, we simplify the 
members of the visualising family by means of item (ii) of Theorem 3. That is, write 
constants {b and c in the occurrence) only in the bottommost Pfaffian equations. 

Proposition 3. The germ, D, at G 1R 9 of an EKR 

dx\ — x 2 dt = dy\ — y 2 dt = 

dt — x 3 dx 2 = dy 2 — y^dx 2 = (3) 

dx 3 — {b + x^)dx 2 = dy 3 — (c + y±)dx 2 = . 

3 after a question by Agrachev whether the moduli of the local classification of Goursat objects survived 
the passage to nilpotent approximations 
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is strongly nilpotent iff b = c = 0. 

The remaining of the present chapter is devoted to a proof of this (rather unexpected) fact. 
The basic reference is a highly constructive algorithm from [Be] for computing nilpotent 
approximations; to that algorithm one can often add shortcuts pertinent to objects under 
consideration, as is the case for (3). 

When b = c = 0, the germ is strongly nilpotent by Theorem 4 (last item) of [M2]. 
Assume now (b, c) ^ (0, 0). Under this assumption, upon computing the small flag of D 
at 0, it becomes visible that the small growth vector 4 of D at is 

[3, 5, 7, 8, 9] (4) 

and that an imposing- by-itself collection of linearly adapted, for D at 0, coordinates is 

x 4 , y 4 , x 2 , x 3 - bx 2 , y 3 - cx 2 , t, y 2 , xi, yi . 

The weights (read off from the small vector) attached to these variables are 1, 1, 1, 2, 2, 3, 3, 4, 5, 
respectively; compare the increments in the vector (4). Improving these coordinates to 
adapted (i.e., having non-holonomic orders not only not exceeding, but coinciding with 
the weights) coordinates z±, z 2 , ■ ■ ■ , z 9 , 

zi=x 4 , z 2 = y 4 , z 3 = x 2 ; z 4 = x 3 -bx 2 , z 5 = y 3 - cx 2 ; 

, b 2 c 2 6 3 be 4 

z% = t - -x 2 , z 7 = y 2 - -x 2 ; z 8 = x 1 - -x 2 ; z 9 = y x - —x 2 , 

permits to ascertain the nilpotent approximation D of D. To this end one has to watch 
D = (Z 1: Z 2 , Z 3 ) in these coordinates and extract all the (nilpotent) terms of weight 
— 1 in the Taylor expansions of the vector fields' generators. It is clear that Z 2 = d/dx 4 
becomes now d± and Z% = d/dy 4 becomes d 2 . After more (elementary) computations there 
emerges the new form of the most involved generator Z\ in our EKR, 

Zi = d 3 + zid A + z 2 d 5 + z a 8q + z 5 d 7 + z 3 z 4 d s + (z 4 z 7 + bz 3 z 7 + -z 3 z 4 )d 9 . (5) 

The only non-nilpotent term in all three generators is z 4 z 7 dg in Z\ of weight 2 + 3 — 5 = 0. 
All the remaining terms are of weight —1 and so survive the passing to the nilpotent 
approximation D. Consequently, that latter distribution is spanned by Z 2 = d 1 , Z 3 = d 2 
and by 

Z x = d 3 + z x d A + z 2 d 5 + z 4 d 6 + z 5 d 7 + z 3 z 4 d 8 + (bz 3 z 7 + -z 3 2 z 4 )d 9 . 

At this point D is found, but not yet well understood. In order to analyze it smoothly, 
we pass to other, also adapted for D at 0, variables z±, . . . , z 5 , zq, 

4 the small growth vector of a distribution D at a point p is the sequence of linear dimensions at p of 
the members of the small flag of D 
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z 7 , z 8 , zg, where 

-_ -_ __ 1 2 

ZQ — Zq — Z3Z4 , Z-j — Z-j — Z 3 Z^ , Zg — Zg — —z 3 Z4 , 

— & 2 1 S/7 x 

Z 9 = z 9 - 7^3 Z 7 + 7 Z 3 i bz 5 ~ CZ4) . 
2 O 

In them, the first generator of D becomes tractable, 

Z x = d 3 + z x d A + z 2 d 5 - ziz?,d & - z 2 z 3 d 7 - ^i% 2 <9 8 + -z 3 3 (bz 2 - cz x )d % . 

Now observe that each product of two or more factors from among Z\, Z 2 , Z 3 has no 
components in d±, d 2 , d%, and depends only on zi, z 2 , z 3 , as Z\, Z 2l Z 3 themselves do. 
Therefore, any product of two products of factors from among Z\, Z 2 , Z 3 vanishes. In 
consequence, the big flag of D coincides with the small one. 

Hence the big growth vector of D at coincides with the small growth vector of D at 
0, and the latter is but the small growth vector of D at (the key property of nilpotent 
approximations), that is (4). In this way we know the big vector of D at 0, and find it 
different from the big vector of D at 0, [3,5,7,9]. The germs at of D and D are thus 
non-equivalent. □ 



3 Classification in length three 

Suppose that there is given a special 2-flag germ of length r > 2, generated by a rank- 
3 distribution D = D r , displaying, at the reference point, an inclusion in the second 
sandwich. It was explained in section 1.5 that the locus, say H, of the inclusion F(-) D 
_D 2 (-) is - always and automatically - an embedded codimension-one sub manifold. 

In length r = 3, around any point p of H one can ask if D is transverse or tangent to 
H at p. 

Example 3. At points displaying the basic geometry 1.2.2, D is always transverse to H, 
whereas at the 1.2.3 points it is tangent to H. The reason becomes visible in any EKR 
glasses: around any 1.2.2 point, the generator Z\ has the bare component d/dx 3 , whereas 
around any 1.2.3 point that generator has the component y^djdx 3 that vanishes at 0. 
This observation offers, besides, an alternative (and very simple) way of specifying the 
second letter 2 in the sandwich class 1.2.2. (And more widely, in any class with a pair of 
neighbouring 2's in the code, concerning the refining of the second 2 in the pair.) This 
way, however, does not lend itself to full scale generalizations, while the way recapitulated 
in section 1.4 is universal. 

Let us ask this question at points having the geometry 1.2.1. As we know already, the 
proper visualisation around these points is the pseudo-normal form (3) in which b, c are 
certain, a priori unknown parameters. 
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Observation 3. Assume that the flag of D has at p the geometry 1.2.1 and that H 3 p 
is the hypersurface of the inclusion in the second sandwich. Then D is tangent to H at p 
if and only if b = in any visualisation (3) for D around p. 

In order to prove this one recalls that then H = {x 3 = 0}, while Z\ has the component 
{b + X4)d/px3 taking at the value bd/dx 3 . □ 

In view of this observation, the singularity class 1.2.1 splits into two invariant parts, 
transverse and tangent. Independently, it splits (Proposition 3) into two other invariant 
parts depending on the strong nilpotency holding true or not. Moreover, the latter prop- 
erty (b = c = in the glasses) implies the tangency (b = in the glasses). The class 1.2.1 
thus splits into three invariant parts 

• 1.2.1_ Sitra — germs in 1.2.1 not strongly nilpotent and transverse, 

• 1.2.1_ Sjtan — germs in 1.2.1 not strongly nilpotent and tangent, 

• 1.2.1 +s — the strongly nilpotent germs in 1.2.1. 

We are now in a position to locally classify the special 2-flags of length three. 

Theorem 4. In length three there exist altogether 7 orbits of the local classification of 
special 2-flags. The singularity classes 1.1.1, 1.1.2, 1.2.2, 1.2.3 of codimensions, resp., 
0, 1, 2, 3, are single orbits with the normal EKR forms 1.1.1, 1.1.2, 1.2.2, 1.2.3 with 
all constants zero as respective local models. 

The three invariant parts of the singularity class 1.2.1 of codimension 1 are orbits, too. 
In fact, all members of 1.2.1_ s tra are equivalent to 



dx\ — x 2 dt 
dt — x^dx2 
dx3 — (1 + X^)dX2 















dyi - y 2 dt 
dy 2 - yzdx 2 
dy 3 - y 4 dx 2 











and this orbit has codimension one. 

All members of 1.2.1_ s>ta ii are equivalent to 



dx\ — x 2 dt = 
dt — x 3 dx 2 = 
dx 3 — x^dx2 = 



dy 3 



dyi - y 2 dt 
dy 2 - y 3 dx 2 
(1 + y 4 )dx2 











and the orbit' codimension is two. 



All members of 1.2. l +s are equivalent to the germ at G M 9 of 



dx\ — x 2 dt = 
dt — x 3 dx 2 = 
dx 3 — x A dx2 = 



dyi - y 2 dt = 
dy 2 - Vzdx 2 = 
dyz - ydx 2 = , 



and the codimension of this orbit is three. 
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Corollary 1. Strongly nilpotent germs of special 2-flags in length three are all those in: 
the first four and the last (seventh), orbits in the above theorem. 



(As regards the first four orbits, it is so in view of Theorem 4, last item, in [M2]. At this 
point, however, it should be noted that in [M2] the families like 1.1.2 or 1.2.2 were not 
yet ultimately simplified, cf. p. 169 there.) 

Corollary 2. It follows from the contents of Section 2 that the germs in different orbits 
1.2.1„ stra and 1.2. l_ s , tan have at the reference points the same s.gr.v. (4). Thus, for 
special 2-flags, the small growth vector does not discern all orbits of the local classification 
already in length three. (For G our sat flags the smallest such length is seven.) 

Proof of Theorem 4. Concerning 1.1.1, it is again Theorem 1 of [KRub]. Concerning 1.1.2, 
one can, without loss of generality, work with the following EKR's, 

dx\ — x 2 dt = dyi — y 2 dt = 

dx2 — x%dt = dy2 — y%dt = 

dt — x^dxs = dy 3 — (c + y^dx^ = . 

As in the class 1.2, it is natural to venture y% = y 3 + cx%. Then this expression for y 3 
plugged in to dy 2 — yj,dt sparks a simplification, because x%dt = dx 2 . Thus y 2 = y 2 + cx 2 
is needed. And this y 2 substituted to dy\ — y 2 dt causes again a simplification due to 
x 2 dt = dxi, and yi =Vi + cx\ is needed to conclude. 

As regards 1.2.2, without loss of generality one can work with the following EKR's, 

dx\ — x 2 dt = dyi — y 2 dt = 

dt — x 3 dx 2 = dy 2 — y%dx 2 = (6) 

dx 2 — x^dxz = dys — (c + y^dx^ = , 

trying to reduce to the constant c. The technique is similar to that employed for the 
previous class. One starts with y 3 = y 3 + cx 3 , then spots x 3 dx 2 = dt holding true in the 
system (6) and takes y 2 = y 2 + ct, after which concludes with y\ = y 1 + ft 2 . 

As for 1.2.3, no work is needed after previous simplifications in length two, and the 
local model 

dx\ — x 2 dt = dyi — y 2 dt = 

dt — x 3 dx 2 = dy 2 — y^dx 2 = 

dx 2 — x^dyi = dxs — y^dy^ = , 

follows. Also the part 1.2.1 +s of 1.2.1, after Proposition 3, needs no work, and the same 
applies to the part 1.2.1_ s tan : in the pseudo-normal form (3) there must hold b = 0, c ^ 0, 
and such c is easily normalizable to 1. 

There remains the part 1.2.1_ s tra of 1.2.1 when, in (3), 6^0 and c is arbitrary. 
One can assume (by means of passing to the bar variables x 4 = 6x 4 , x 3 = bx 3 , t = bt, 



14 



x\ = bxi, yi = by x ) that 6 = 1. Then starts as in previous cases with y s = y 3 + cx^. But 
dxs = (1 + xi)dx2 in the Pfaffian system in question. Hence 

dy3 - (c + y±)dx2 = dy 3 + c(l + x A )dx 2 - (c + 2/4)^2 = #3 - (?/4 - cx A )dx 2 ■ 

Now it imposes by itself to write j/ 4 = y A + CX4, continue with 2/2 = 1/2 + an d conclude 
with yi — y x + ft 2 . Theorem 4 is proved. □ 

Remark 4. The local classifications of special /c-flags, k > 2, of lengths not exceeding 
three coincide with those in width two, fc = 2. In particular, Theorem 4 directly general- 
izes: there are always 7 orbits (four of them being singularity classes and the remaining 
three building up the class 1.2.1) having the same characterizations as in width two. 
In other words, the classifications in lengths not exceeding three are stable with respect 
to width k > 2. 

4 Classification in length four — simpler part 

The question that stands now is about the orbits sitting inside the fourteen singularity 
classes listed in Example 1. We start with with an elementary 

Theorem 5. In length four, only the following 6 singularity classes of germs of special 
2-flags (out of altogether 14 existing in that length) are single orbits of the local classifica- 
tion: 1.1.1.1, 1.1.1.2, 1.1.2.2, 1.1.2.3, 1.2.2.3, 1.2.3.3. As unique local models there can be 
taken, respectively, the EKR's 1.1.1.1, 1.1.1.2, 1.1.2.2, 1.1.2.3, 1.2.2.3, 1.2.3.3 with 
all constants appearing in them equal to 0. In consequence, all these classes are strongly 
nilpotent. 

Proofs for these classes go entirely analogously to those in length three concerning the 
classes 1.1.1, 1.1.2, 1.2.2, and 1.2.3; only the chains of consecutive passings from variables 
ytoy are sometimes longer by one step. 

Nextly we group together four singularity classes that split (each of them) into no 
more than three orbits. 

Theorem 6. In length four, the classes 1.2.2.2 and 1.2.3.2 consist of two orbits each. 
Whereas the classes 1.1.2.1 and 1.2.1.3 consist of three orbits each. The codimensions of 
orbits, and local models, are listed in the proof. 

4.1 Proof for 1.1.2.1 — the strong nilpotency at work. 

The method for 1.1.2.1 is analogous to that for the class 1.2.1, and 1.1.2.1 splits into: 
1.1.2.1 +s - strongly nilpotent germs (an orbit of codimension three), 1.1.2.1_ Sjtra (a generic 
part in 1.1.2.1 and an orbit of codimension one equal to the codimension of the class) - 
germs not strongly nilpotent and transverse to the locus H of hitting the sandwich class 
1.1.2.1, and 1.1.2.1_ s ta n (an orbit of codimension two) - germs not strongly nilpotent and 
tangent to H at the reference point. 
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When searching for local models in 1.1.2.1, the unique local model for 1.1.2, 

dxi — x 2 dt = dyi — y 2 dt = 

dx 2 — x 3 dt = dy 2 — y 3 dt = 

dt — x^dxz = dy 3 — y 4 dx 3 = , 

is to be extended by a couple of equations 

dx 4 — (B + x 5 )dx 3 = dy 4 — (C + y 5 )dx 3 = 

with two new parameters B and C. For a representative of the strong nilpotency part 
1.1.2. l +s , we take B — C — 0. The proof that the complement in 1.1.2.1 of the germs 
equivalent to the particular 1.1.2.1 with B = C = 0, consists of not strongly nilpotent 
germs, now splits into two separate proofs, according to two highly different underlying 
geometries. (For 1.2.1, off the scope of strong nilpotency, there were also two different 
geometries, yet they displayed one and the same small growth vector, cf. Corollary 2, and 
could have been subsumed into one computation.) 

Namely, the 'sandwich^ locus H has now the equation x 4 = 0,jmd the germs with 
B are transversal to H, while those with B = are tangent to if at (our reference 
point). In the normal forms for transversal ones, the constant C can be easily reduced to 
(as in the handling of 1.2.1_ s tra in the proof of Thm. 4). A local model for 1.1.2.1_ s tra 
is obtained by further normalizing B to 1. In the normal forms for tangent germs, C can 
be reduced to 1, yielding a model for 1.1.2.1_ s tan . 

• All germs in 1.1.2.1_ Sjtra are not strongly nilpotent: 

A careful computation shows that, independently of a germ in 1.1.2.1_ s tra , the departure 
point - the small growth vector at the reference point - is [3,5,7,8,9,10,11]. We work 
with C already annihilated and B ^ - and improve the starting EKR coordinates to 
linearly adapted 

%5, 2/5, x 3 , x 4 -Bx 3 , y 4 , t, 2/3, x 2 , y 2 , x u y x . 

These coordinate functions are not yet adapted (the attached weights, read off from the 
small vector, are 1, 1, 1, 2, 2, 3, 3, 4, 5, 6, 7, while the non-holonomic orders of functions are, 
in some cases, smaller). Improving them further, by Bellaiche adopted to the situation, 
yields (certain) adapted coordinates 

B 2 

zi = x 5 , z 2 = 2/5, z 3 = x 3 ; z A = x A - Bx 3 , z 5 = y 4 ; z 6 = t- —x 3 , 

z 7 = y 3 ; z 8 =x 2 - ^-xf ; z 9 = y 2 ; z w = x x - ^-zf ; z u = y x 

in which the nilpotent approximation can be distracted already. Namely, Z 2 = d/dx 5 
becomes now d\ and Z 3 = d/dy$ becomes d 2 , while Z\ assumes the form 

Z\ = d 3 + zid 4 + z 2 d 5 + z 4 d 6 + z 5 d 7 + z 3 z 4 d 8 + ( z 4 z 7 + Bz 3 z 7 )d 9 

, B \ 

+ (z^zs + Bz 3 z 8 + —z 3 z 4 )d 10 + ( z 4 z 9 + Bz 3 z g )d n . 
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The underlined terms are of degree 0, the remaining are of degree —1. Recalling, passing 
from a distribution to its nilpotent approximation consists in leaving out all the terms, 
in arbitrary adapted coordinates, of degrees exceeding —1. In the present case, thus, D is 
generated by Z 2 = Z 2 , Z 3 = Z 3 and by 

Zi = d 3 + z x d 4 + z 2 d 5 + z 4 d 6 + z b d 7 + z 3 z 4 d 8 (7) 
+ Bz 3 z 7 d 9 + (Bz 3 z 8 + —z 3 z 4 )d w + Bz 3 z 9 d u . 

Similarly as working earlier with 1.2.1, through (7) one does not see the properties of D. 
Hence seeks coordinates that are more adapted. After a careful search, zi, . . . , z$ and 

-_ -_ 1 2 

z 6 — Zq — Z 3 Z 4 , Z 7 — Z 7 — Z 3 Z$ , Z 8 — Z 8 — -Z 3 Z 4 , 

— B 2 B 3l B 2 B 4 

Z9 — Zg — —Z 3 Z 7 + — Z 3 OZ5 , Z10 — Zio — —Z 3 Z 8 + ~^ z 3 z i i 

R R2 R2 
D 2 -D 4 D g 

^11 = ^11 - —^3 ^9 + "g"^ ^7 - ~^ Z 3 Z 5 > 

are such that Z 2 and Z 3 remain unchanged, while Zi assumes the form 

Z\ = d 3 + 2194+2295 - 2i2 3 (9 6 - z 2 z 3 d 7 - ^z ± z 3 d 8 (8) 

+ ~^z 2 z 3 d 9 + — 2i2 3 4 9i - — z 2 z 3 d u . 

That is to say, the components $4 through dn in the fields Zi, % = 1, 2, 3 spanning 
D depend now only on Z\, z 2 , z 3 , while the (d±, d 2 , <9 3 )-components are constant. This 
implies, like earlier in the proof of Proposition 3, the coincidence of the small and big 
growth vectors of D at the reference point 0. In consequence, the germs at 0, D and D, 
have different big growth vectors, hence are non-equivalent. 

•• All germs in 1.1.2.1_ s t an are not strongly nilpotent: 
We begin with a remark that a new proof is needed for this part because the small growth 
vector governing 1.1.2.1_ Sjta n is different from that servicing 1.1.2.1_ s tra . In fact, after a 
delicate computation, it is [3, 5, 7, 8, 9, 9, 10, ll]. 5 Now B = in the pseudo-normal forms, 
and we purposedly keep a general C 7^ 0. The argument evolves, again, stepwise. Firstly 
one passes from the EKR coordinates to linearly adapted 

x 5 , y 5 , x 3 , x 4 , y 4 - Cx 3 , t, y 3 , x 2 , y 2 , x 1 , y x , 

whose weights are now 1, 1, 1, 2, 2, 3, 3, 4, 5, 7, 8, respectively. It appears that, among them, 
only y 3 is not adapted: its non-holonomic order is 2, and weight 3; it suffices to improve 
it to y 3 — *jx 3 . In this way there emerges a set of adapted coordinates 

z\ — X5 , z 2 = 1/5 , z 3 = x 3 ; z 4 = x 4 , z§ = y 4 — Cx 3 ; z% = t, 



5 it was not so in length three with 1.2.1_ Si t ra and 1.2.1_ Si tan! This kind of complications, when the 
length grows, is typical in special 2-flags. 
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C 2 

Z7 = U3- ; z 8 = x 2 ; z 9 = y 2 ; z 10 = x x ; z u = y x . 
In these coordinates Z 2 — d±, Z 3 = d 2 , and 

C 

Zi = d 3 + zi d 4 + z 2 d 5 + z 4 d 6 + z 5 d 7 + z 3 z 4 d 8 + ( z 4 z 7 + — z 3 z 4 )d g + z 4 z 8 d 10 + z 4 z 9 d n . 

The nilpotent approximation [Z\, Z 2 , Z 3 ) is obtained by only leaving out this one under- 
lined term z 4 z 7 dg of degree in Z x . That is, Z 2 — di, Z 3 = d 2 , and 

^ C 

Z\ = d 3 + z x d 4 + z 2 d 5 + z 4 d 6 + z 5 d 7 + z 3 z 4 d 8 + — z 3 2 z 4 d 9 + z 4 z 8 d 10 + z 4 z 9 d n . (9) 

As often in flags, nothing is visible in such Bellaiche-like vector field (9) save that it seems 
not possible that just leaving out the above single term results in the slowing down of the 
big vector, at the reference point, from [3, 5, 7, 9, 11] (for D) to [3, 5, 7, 8, 9, 9, 10, 11] (for 
D). But this is the case! To see this, it suffices to improve the adapted coordinates to 

1 2 

Z Q — Zq — Z 3 Z 4 , Z-j — Z-j — Z 3 Z<a , z 8 — z 8 — —z 3 z 4 , 

C 3 1 2 ^32 ^ 4 

z 9 = Zg — —Z 3 Z 4 , Zio = Zio — Z 3 Z 4 Z 8 + ~^ Z \ Z Z Z S + 7^3 Z A — 7^ Z ^ Z 3 Z 4 ' 

1 2 C 4 2 ^* 5 

Zn = Z\\ — Z 3 Z 4 Zg + —Z\Z 3 Zg + — Z 3 Z A — 777^1^3 z i ■ 

z o 1U 

In these [more sophisticated] zi, . . . , Z5, zE, . . . , zu, the involved generator (9) becomes 
but 

Zi = d 3 + z\d 4 + z 2 d^ — z 1 z 3 d e — z 2 z 3 d 7 — -Ziz 3 d 8 — —Ziz 3 d 9 — -^z%z 3 d w — —z^z 3 dn . 

And the reader knows already that such an expression, using only z±, z 2 , z 3 in components, 
guarantees that the big and small vectors of D, and hence the small of D, all coincide. 
Thus D is far from being equivalent to D. 



4.2 1.2.1.3. 

Concerning 1.2.1.3, the previous discussion of 1.2.1 applies to the Lie squares of members 
of this class, while the prolongation to length four leaves no freedom on the level of EKR 
pseudo-normal forms, because the last 3 in the code corresponds to the prolongation 
pattern 3 that brings in no new parameters. In fact, a distribution now being denoted D A 
and its [factored out] Lie square D 3 /L(D 3 ) being a distribution germ sitting in the class 
1.2.1, 1.2.1.3 is split up according to the local geometry of D 3 /L(D 3 ): of the type 1.2. l +s , 
or 1.2.1_ Sj tra, or else 1.2.1_ S)tan . In either case the relevant local model for D 3 /L(D 3 ) is 
being extended by one precise pair of Pfaffian equations 

dx 2 — x^dy 4 = dx 4 — y$dy 4 = . 
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4.3 Proof for the classes 1.2.2.2 and 1.2.3.2 . 



It turns out that the germs of special 2-flags sitting in 1.2.2.2 are either strongly nilpotent, 
1.2.2.2 +s , and then equivalent to the EKR 

dx\ — x 2 dt = dyi — y 2 dt = 

dt — x 3 dx 2 = dy 2 — yzdxi = 

dx 2 — x 4 dx 3 = dy 3 — y 4 dx 3 = (10) 

dx 3 — x 5 dx 4 = dy 4 — y^dx 4 = 

(this orbit is of codimension four - its materialization has, for the object (10), local 

equations x 3 = x 4 = x 5 = y 5 = 0), or else not strongly nilpotent, and in that case 
equivalent to 

dx\ — x 2 dt = dyi — y 2 dt = 

dt - x 3 dx 2 = dy 2 - y 3 dx 2 = 

dx 2 — x±dx 3 = dy 3 — y^dx 3 = (11) 

dx 3 — x<~,dxi = dyi — (1 + y$)dxi = 

(this is the generic orbit of codimension three; its materialization has, for the object (11), 
local equations x 3 = x^ = x$ = 0). 

In fact, to show that the orbit of (11) consists of not strongly nilpotent germs is rather 
lengthy; instead, we are going to demonstrate (what is enough for theorem) the non- 
equivalence to the strong nilpotency part 1.2.2.2 +s . 

Indeed, suppose that the object (10) is equivalent, as the germ at G R 11 , to an EKR 
like (11), with a constant C in the place of 1 in the last Pfaffian equation there. That is, 
suppose the existence of a conjugating diffeomorphism 

$ = (T, X u Yi, X 2 , Y 2 , X 3 , Y 3 , X 4 , Y 4 , X 5 , Y 5 ) : (R 11 , 0) ^ 

(note its preserving of 0, for only the germs at are being discussed). The aim is to show 
that C — 0. Similar situations of hypothetical conjugacies between different EKR's will 
frequently occur later. Because of that it is important to carefully describe restrictions 
such $ (and several other conjugacies appearing later in the paper) is subject to. First of 
all, the EKR's that are conjugated have, by Observation 1, the same nicely positioned 
subflag of associated involutive subdistributions 

F D L(D 1 ) D L(D 2 ) D L(D 3 ) D L(D 4 ) = 

which must be preserved by $. It implies that 

• T, Xi, Y\ depend only on t, x±, y\ , 

• for 2 < j < 4, functions Xj, Yj depend only on t, x±, y±, . . . , Xj, j. 
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In turn, it will momentarily become visible that - in the discussed situation - one knows 
even more about the components X 3 , X 4 , and X 5 . 

Indeed, whenever there happens - as in our case - an inclusion in the second sandwich, 
F(Q) D D 2 (0), of the sandwich diagram for D given by (10) or by (11), it happens not at 
isolated points like but in codimension one. For, in any EKR for D in the vicinity 
of 0, taking again into account Observation 1, the locus of the inclusion F(-) D D 2 (-) 
has the equation x 3 = 0. Similar remarks apply to the inclusions in the third and fourth 
sandwiches, L(D 1 )(-) D D 3 (-) and L{D 2 ){-) D D\-). 

Therefore, both flags have the same singularity loci of the inclusions holding true in the 
indicated sandwiches, and these loci locally are but the hyperplanes {x 3 = 0}, {x 4 = 0}, 
and {x$ = 0}. The mapping $ preserves these, meaning that its relevant components are 
divisible, as function germs, by x 3 , x 4 , X5, respectively. I.e., that there exist invertible at 
functions K, H, G, also only depending on the variables specified above and satisfying 

• X 3 (t, x u . . . , y 3 ) = x 3 K(t, x x , . . . , x 3 , y 3 ) , 

• X 4 (t, X!,..., y A ) = x 4 H(t, xi, . . . , x 4 , 2/4) , 

• X 5 (t, xi, . . . , y 5 ) = x 5 G(t, xi, . . . , x 5 , y 5 ) , 

(letters are taken in this order because of the subsequent nestings x^G — > x 4 H — > x 3 K 
in (12)). Proceeding in our arguments, let us reiterate that $ preserves the distribu- 
tion (<9/<9x 5 , d/dy 5 ) (which in both cases is L(D 3 )). In consequence there must exist an 
invertible at function /, / | 7^ 0, such that 



( 



X3\ 



( 



x 5 G 




;i2) 



where the *'s are functions whose nature is irrelevant for the argumentation. In (12), 
for brevity, p stands for (t, xi, yi, . . . , x 5 , y 5 ). The first conclusion from this rich set of 
conditions is 

|^|o=C/| , (13) 

after which one looks for an information on Y 4 . The 7-th row of (12), after dividing it 
sidewise by x$, gives an expression for fGY 4 in terms of Y 3 which in turn implies 

dY 4 , dY 3 , 



fG 



Oxa 



— 



dx-. 



1 j 



(14) 
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after which one looks for an information on Y 3 . And the 5-th row of (12), after dividing 
it sidewise by X5X4, supplies an expression for fGHY 3 in terms of Y 2 . That expression 
implies, among others, that 



dx 

and 



dY 2 , 





fGHY 3 \ = (15) 



^\o=fGH^\ . (16) 
0x2 ox 2 

One predicts already that, after dividing it sidewise by x§x 4 x 3 , the 3-rd row of (12) yields 
an expression for fGHKY 2 in terms of Y]_. It is crucial that that expression is affine in X2 
- its second derivative wrt x 2 vanishes identically. At the same time that second derivative 
at is equal to 

2 q 7; r jGHK -—T- - jGHK — — ^ 

OX2 OX2 ox 2 OX2 

(the last equality in view of (15) ). Therefore, the LHS, and hence also the RHS of (16) 
vanishes. Now (14) and (13) directly infer C — 0. So, indeed, the zero and non-zero values 
of C are not equivalent. On the other hand, any non-zero value can be easily rescaled to 
the value 1 - all such germs are equivalent to (11). The class 1.2.2.2 is settled. 

As for the class 1.2.3.2, its members are either strongly nilpotent and equivalent to 

dx\ — x 2 dt = dyi — y 2 dt = 

dt - x 3 dx 2 = dy 2 - y 3 dx 2 = 

dx 2 - x 4 dy 3 = dx 3 - y±dy 3 = 

dy 3 — x^dxi = dyi — y^dx^ = , 

building up the orbit 1.2.3.2 +s of codimension five (with local equations of its material- 
ization being x 3 = rr 4 = y 4 = x$ = y$ = 0), or else not strongly nilpotent and equivalent 
to 

dx\ — x 2 dt = dyi — y 2 dt = 

dt — x 3 dx 2 = dy 2 — y 3 dx 2 = 

dx 2 - x A dy 3 = dx 3 - y A dy 3 = 

dy 3 - x 5 dx 4 = dy 4 - (1 + 2/5)^4 = , 

building up the generic orbit 1.2.3.2_ s of codimension four (with local equations x 3 = x 4 = 
y 4 — £5 = 0). The proof of the non-equivalence of these two representatives is analogous 
(and simpler) than that servicing the class 1.2.2.2; the lack of the strong nilpotency within 
the second orbit is, however, even harder to show. 
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5 Classification in length four — harder part 



It is still more surprising that 

Theorem 7. The singularity classes 1.2.1.2, 1.2.2.1 and 1.2.3.1 split into four orbits 
each. The codimensions and local models are given below in the proofs. 



5.1 Proof for the class 1.2.2.1 



As previously, the Lie square of a distribution germ, factored out by its Cauchy charac- 
teristics sits in 1.2.2 whose unique local model is known. So one can take those Pfaffian 
equations 



dxi — x 2 dt = 
dt — x 3 dx 2 = 
dx2 — x 4 dx 3 = 

and add to them a couple of new ones, 

dx4 — (B + x$)dx 3 = 



dyi - V2dt = 
dy 2 - Vzdx 2 = 
dy 3 - yidx 3 = , 



dy 4 - (C + y 5 )dx 3 = 



with unknown parameters B and C. The situations 5^0 and B = are geometrically 
different, and hence non-equivalent: the vanishing of B means precisely the tangency 
of a distribution at the reference point (here 0) to the locus of the inclusion in the 3-rd 
sandwich. Moreover, in the transverse case B ^ it is easy to normalize B to 1. Assuming 
this done already, now instead of B we have a discrete parameter e G {0, 1} that bears a 
geometric meaning: e = 1 is transversality, e = - tangency. And, keeping e constant, we 
try to conjugate, via a preserving the origin diffeomorphism $ = (T, Xi, Y±, . . . , X$, Y5) 
of M 11 into itself, the two relevant EKR's: with C = and C / 0. This boils down, as in 
the discussion in Section 4.3, to the vector equation 



d$(p) 




f(p) 



X4G 



xsHl X 2 
\Y 2 



V 



1 

Y 3 
1 

Y 4 

e + X 5 
C + Y 5 



(17) 



where /(0) 7^ and now only X 3 = x 3 H, X 4 
holding only in 2nd and 3rd sandwich). 



x 4 G are of such special form (inclusions 
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The 9-th row in (17), evaluated at 0, reads 



dY > +e a Ji U=C fU. (18) 



8x 3 

The 7-th row in (17) gives /I4 in function of Y 3 , which implies 



dY 3 

= /y 4 |o = tt^Io, (19) 



dx 



3 



<9F 4 J _ <9 2 r 3 

&C3 cte 

and 



^l»=M l0 ' (20) 



f aFl I - I (21) 
ax 4 ot 2 

In a cascade of arguments, the 5-th row of (17), after dividing it sidewise by re 4 , yields an 
expression for fGY 3 , in terms of Y 2 , which is affine in x 3 . Hence its second derivative wrt 
x 3 vanishes, and in particular 

(in view of (19) ). Now this equality (22) together with (20) show that the first summand 
on the LHS in (18) vanishes. Passing to the second summand, that mentioned above 
expression for fGY 3 implies not only (22) but also 

BY 

= fGY 3 \ = — ^| (23) 
ox 2 

and 

fG m I - ^ I (24) 

And this last equality, via (21), reduces the handling of the term |^ | in (18) to the 
second derivative at of Y 2 with respect to x 2 . 

Continuing the cascade, it is the 3-rd row in (17) which, after dividing it sidewise by £4X3, 
gives an affine in x 2 expression for fGHY 2 . That expression, doubly differentiated wrt x 2 
to an identical zero, implies 

_ dJJGH) dY 2 d*Y 2 d 2 Y 2 

u - 1 s 7^ 1" jGH ——j- I - Jtjti I 

ox 2 ox 2 ox 2 ox 2 

(the last equality by (23) ). The needed derivative turns out to be zero, and so is the LHS, 
hence also RHS, of (18). We have shown that C = 0. Thus, for either of the two values of 
e, the zero and non-zero values of C are shown to be non-equivalent. On the other hand, 
a non-zero C is easily normalizable to 1. So the class 1.2.2.1 splits into four orbits having 
for local models the relevant EKR's with the constants 
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• B = 


1 


c = 


1 


(the generic orbit of co dimension two), 


• D = 


-1 


L> = 


u 


(an orbit of codimension three), 


• B = 


o, 


c = 


1 


(an orbit of codimension three), 


• B = 


o, 


c = 





(the strongly nilpotent part of codimension four) 



The last orbit should be denoted by 1.2.2. l +s , but it is long to show that the remaining 
orbits contain only not strongly nilpotent distribution germs. 



Passing to the singularity class 1.2.3.1, the orbits sitting inside it have [superficially] much 
similar description to those inside 1.2.2.1. We mean the equations for the class 1.2.3, 



for the square of a distribution under consideration, extended by the pair of equations 
pertinent to the (last) cipher 1 in the code 1.2.3.1 , 



in which, naturally, one has to normalize the constants whenever possible. Every such 
EKR sits in the sandwich class 1.2.2 and so the inclusions at the reference point hold 
in both the 2-nd and 3-rd sandwich. The loci of them are {0:3 = 0} and {x^ = 0}, 
independently of the values of B and C. The distribution represented by a given pair of 
values is tangent at to the latter locus if and only if B = 0. One can quickly inspect this 
tangent situation in purely geometric terms. Namely, for each of the EKR's in question 
the locus of the singularity class 1.2.3 (for the Lie square) is {x 3 = X4 = 1/4 = 0}. In 
the tangent situation B = 0, it is natural to ask the question whether the distribution 
is tangent, at the reference point 0, to this locus. And it is iff C — 0. Hence the germs 
in 1.2.3.1 equivalent to the EKR with B = C = are simultaneously tangent to the two 
singularity loci: of 1.2.2 and 1.2.3. Whereas those equivalent to an EKR with B = 0, 
are tangent to the locus of the inclusion D 3 C L(D r ), but not to the locus of more 
fine geometry 1.2.3. 

In the transvese case, it is straightforward to normalize B to 1, after which there pops 
up the question of the relevance of C. So we try, exactly as for 1.2.2.1, to conjugate, by 
means of a diffeomorphism $, the zero value with a non-zero C. The mentioned loci have, 
of course, to be preserved by $ = (T, X±, . . . , F 5 ), whence the components X 3 and X 4 of 
$ are of special form, X 4 = x A G and X 3 = x 3 H; G, H invertible at 0. Moreover, there 



5.2 The discussion of 1.2.3.1 and 1.2.1.2 . 



dxi — x^dt = 
dt — x 3 dx 2 = 
dx 2 — X4dy 3 = 



dyi - y 2 dt = 
dy 2 - yzdx 2 = 
dx 3 - y 4 dy 3 = , 



dx 4 — (B + x 5 )dy 3 = 



dy4 - (C + y 5 )dy 3 = 
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must hold 



X4 





( 1 


[ 

X 3 \ 


^2 




V2/2 




1 




2/3 




2/4 




1 


1 - 


1- x 5 


2/5 














(25) 



with an invertible at factor function /. We will use this set of conditions as modestly 
as possible. The main relation, implied by the 9-th row in (25), reads 



dY 4 dY 4 
— - H 

dy 3 dx A 



C/|o • 



(26) 



It will momentarily turn out that both summands on the left disappear. Indeed, for either 
of the EKR's the locus of the singularity class 1.2.3 (for the Lie square) is {0:3 = x 4 = 
y 4 = 0}. This set has, therefore, to be preserved by $. Consequently, 

I4 e {x 3 , x 4 , y 4 ) , 

the ideal of functions' germs generated by the listed generators. Thus the first summand 
on the LHS of (26) vanishes. Passing to the second one, let us call simply Z the vector 
field in (25) to which rf$ is being applied. Then the 6-th row in (25) says that 



/y 4 = ZX 3 = Z(x 3 H) = y 4 H + x 3 ZH E {x 3 , y 4 ) . 

Therefore, I = 0, implying the vanishing of the second summand on the LHS in 

(26). In the transverse case the non-zero values of C are not equivalent to the zero value. 
At the same time, the non-zero values of C are readily normalizable to 1, and so the list 
of local models for 1.2.3.1 reads, formally as for 1.2.2.1, 



• 5 = 1, C = 1 

• B = l, C = 

• B = 0, C=l 

• B = 0, C = 



transverse generic, 
transverse atypical, 

tangent to '1.2.2', but not tangent to '1.2.3', 

tangent to both '1.2.2' and '1.2.3', or: strongly nilpotent. 



As regards the class 1.2.1.2, it is reasonable to split the analysis into two cases. Either 

• the square of a distribution - the suspension of a 1.2.1 germ - is tangent at the 
reference point to the locus of the singularity 1.2.1, 
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or else 

•• the square of a distribution is transverse at the reference point to the locus of the 
singularity 1.2.1. 

Surprisingly, it is the • case that is easy. Indeed, by our earlier Theorem 4 (in its part 
concerning 1.2.1), the first three pairs of equations are then simplified as follows, 

dxi — x 2 dt = dyi — y 2 dt = 

dt - x 3 dx 2 = dy 2 — y 3 dx 2 = (27) 

dx 3 — x^dx 2 = dy 3 — (e + yi)dx 2 = , 

with e being either 1 (when the square is not strongly nilpotent) or (the square strongly 
nilpotent), while the last pair 

dx 2 — x 5 dx4 = dy^ — (c + ys)dx± = 

is open to further simplification. We mean the standard way y 4 = y A + cx^ y 3 = y 3 + cx 3 , 
2/2 = V 2 + ct, yi = y x + |t 2 . This transformation, irrespectively of the value of e, annihilates 
the constant c, because in the Pfaffian system (27) there hold the simplifying relations 
x^dx 2 = dx 3 and x 3 dx 2 = dt . Therefore, the • case represents but two orbits: 

1.2.1_ Bj tan-2 written down as 

dxi — x 2 dt = dyi — y 2 dt = 

dt - x 3 dx 2 = dy 2 - y 3 dx 2 = 

dx 3 - x A dx 2 = dy 3 — (1 + yi)dx 2 = 

dx 2 — x 5 dx^ = dyi — y^dx^ = , 

and the part, 1.2.1.2 +s , that is strongly nilpotent in 1.2.1.2, 

dx 1 — x 2 dt = dyi — y 2 dt = 

dt - x 3 dx 2 = dy 2 - y 3 dx 2 = 

dx 3 — x^dx 2 = dy 3 — y 4 dx 2 = 

dx 2 — x 5 dx4 = — y^dx^ = , 

As regards the •• case, by Theorem 4 for 1.2.1 again, the first three pairs of equations 
can be simplified to 

dx\ — x 2 dt = dyi — y 2 dt = 

dt — x 3 dx 2 = dy 2 — y 3 dx 2 = (28) 

dx 3 — (1 + x±)dx 2 = dy 3 — y^dx 2 = , 

while the last pair is, for the moment, general 

dx 2 — x 5 dx4 = dy4 — (C + y^dx^ = . (29) 
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We will show that the two situations C = and C ^ in (28) - (29) are non-equivalent. 
To this end, we suppose the existence of a local conjugating diffeomorphism 



$ = (T, X u Y u X 2 , Y 2 , X 3 , Y 3 , X 4 , Y 4 , ^5, n) = (K 11 , 0) 
sending the object with the zero constant to an object displaying a value C: 

( 



d<b(p) 




x 5 G 



( i 




x 3 H\ 


x 2 








1 




Y 3 




1 


{ 


Y 4 




1 


C- 


f Y 5 



* 



(30) 



where the *'s are certain functions; p stands, as usual, for (t, Xi, yi, . . . , x^, y^), and 
/, G, H are invertible function germs. (This time $ preserves the loci of materialization 
of the sandwich class 1.2.12, implying X 5 = Gx$ and X 3 = Hx 3 .) Two basic consequences 
of (30) are 

dY 4 , 



Cf\ 



dx A 



and 



fG 



dY 4 
8xa 



[GYa 
dx 4 



m 

dx 3 



(31) 
(32) 



the latter implied by a direct expression for the function fGY 4 that is encapsulated in 
(30). Thus the properties of Y 3 are getting important. In this respect, the (important) 
normalization to 0, in both germs conjugated by $, of the additive constant standing 
next to x 4 implies 

dY 3 dY 3 , 

7^ + 7^0=0 33 

ax 2 dx 3 

On the other hand, there simply holds 
Lemma 1. gf | o = . 

Proof. Expressing in (30) the function fGY 3 via Y 2 , one gets two informations. The first 
is 

dY 2 . 



while the second is 



fG 



dxo 



dx 2 

= fGY 3 

dx 2 



0, 



d 2 Y 2 
dxl 



(34) 
(35) 
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But (30) allows also to express the function fGHY 2 via Y±, and that expansion is clearly 
affine in x 2 . Hence - ^pr^ = identically. Evaluating this at 0, 

= Mm + fGH n u = fGH *g u (36) 

ox 2 0x2 0X2 ox 2 

by (34). Hence | = 0, whence | = by (35). Lemma is proved. 

In view of Lemma 1, p | = by (33), and so p | = by (32). Now C = by 
(31). 

On the other hand, it is elementary to normalize a non-zero value C in (29) to 1. 
Summarizing, in the •• case the germs are either equivalent to 

dxi — x 2 dt = dyi — y 2 dt = 

dt - x 3 dx 2 = dy 2 - y 3 dx 2 = 

dx 3 - (1 + x 4 )dx 2 = dy 3 - y 4 dx 2 = (37) 



or else to 



dx 2 — x 5 dx 4 = 


■ 


dy A - (1 + y 5 )dx 4 = 


dx\ — X2dt 


= 


dyi — y2dt = 


dt — x^dx2 


= 


dy 2 - yzdx 2 = 


dx 3 — (1 + x 4 )dx2 


= 


dy 3 - y±dx 2 = 


dx2 — x$dx 4 


= 


dy 4 - y^dx 4 = 



(38) 



6 The most involved class 1.2.1.1 

We strive, endly, to classify the class 1.2.1.1 and start from an obvious (and rough) 
pseudo-normal form subsuming this entire class, 

dxi — x 2 dt = dyi — y 2 dt = 

dt — x 3 dx 2 = dy 2 — y 3 dx 2 = 

dx 3 — {b + x^)dx2 = dy 3 — (c + y 4 )dx2 = (39) 
dx^ — (B + x 5 )dx 2 = — {C + y$)dx 2 = . 

The first question is that concerning the strong nilpotency, and for strong nilpotency the 
small growth vectors are important. After not so hard computations, 

Observation 4. The small growth vector at G M 11 of an object (39) is 

[3, 5, 7, 9, 10, 11] when (b, c) ^ (0, 0), 

[3, 5, 7, 9, 10 2 , 11] when (6, c) = (0, 0) and (B, C) ^ (0, 0), 

[3, 5, 7, 9, 10 3 , 11] when (b, c) = (B, C) = (0, 0). 
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Notation. The three disjoint parts of 1.2.1.1 emerging from this observation are denoted, 
respectively (for momentary need), by 10i, IO2, and IO3. 

Proposition 4. The part IO3 entirely consists of strongly nilpotent germs. The parts 10i 
and IO2 contain only not strongly nilpotent germs of 2-flags. 

The idea of proof is the same as in Chapters 2 and 4, and we skip here all details. Instead 
of IO3, one could write, then, 1.2.1.1 +s — the family of all strongly nilpotent distributions 
in the singularity class 1.2.1.1. 

On the other hand, considering the Lie squares of the germs in 1.2.1.1 (that, after 
factoring out by their Cauchy characteristics, sit in the class 1.2.1), one can, with some 
abuse of notation, partition 

1.2.1.1 = 1.2.1_ s , tra Ul.2.1_ s , tan Ul.2.1 +s . (40) 

10i IO2UIO3 

Transvecting the introduced two partitions, one gets a finer partition 

1.2.1.1 = 1.2.1_ S)tra Ul.2.1_ S)tan U 1.2.1 +B ni0 2 u i.2.i +s nio 3 . 
Thus (still abusing notation for brevity) there are already four disjoint invariant parts 

• 1.2.1_ B)tra = 1.2.1_ B)tra nl0i, 

• 1.2.1_ S;tan = 1.2.1_ S)tan n 10i, 

• i.2.i +s n io 2 = io 2 , 

• i.2.i +8 nio 3 = io 3 = 1.2.1. i +s . 

Are these just orbits of the local classification? It will eventually turn out that only the 
first and the last part on the list are. 

To see it, we start by partitioning the second item according to the position (at the ref- 
erence point) of the distribution D in question, with respect to the locus of the singularity 
1.2.1_ stan . We denote by 1.2.1_ stan .l_ stra the germs D that are relatively (i. e., within the 
locus of the sandwich geometry 1.2.1.1) transverse to this locus, and by 1.2.1_ S)tan .l-s,tan 
those that are tangent to it. 

We continue by similarly partitioning the third item, even though the process is now 
more delicate. Namely, this time one will check the position of D with respect to the locus 
of an aggregated singularity 

1.2.1_ S)tan Ul.2.1 +s d ^ f 1.2.1 tan 

(that, in each its materialization, is still smooth, not stratified, and in any EKR coordi- 
nates for D sitting in the third item, has local equations £3 = x^ = 0). We denote by 
1.2.1 +s .l_ s t ra the germs that are relatively transverse to the locus of 1.2.1 tan , whereas by 
1.2.1 +s .l_ S)tan all those that are tangent to that locus. 

With these (prompting by themselves) definitions taken into account, our list of in- 
variant parts of 1.2.1.1 lengthens to six items: 
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1.2.1. 



-s, tra ; 



• 1.2. 1_ s,tan-l— s, tra j 

• 1.2. 1_ s,tan-l— s, tan j 

• 1.2.1 +B .l_ Sj tra j 

• 1.2.1 +s .l_ stan , 

• 1.2.1. l +8 . 

Theorem 8. The singularity class 1.2.1.1 splits into six orbits of the local classification. 
These orbits are listed above this theorem. The codimensions and local models can be read 
off from the proofs. 

7 Proof of Theorem 8 

We will address separately every one part on the list; the proofs for the first and third 
part will be quite involved. 

7.1 The orbit 1.2.1 s tra of codimension one. 

The only generic orbit within 1.2.1.1 is the first item on the list, 1.2.1_ Sjtr a- (Reiterating, 
this symbol should be understood in the sense that checking the inclusion of a germ D in 
this part deals only with the 'shorter' object [D, D]/L([D, D]).) A proof that it is indeed 
an orbit is not short. 

As the reader already knows (Theorem 4), the germs of special 2-flags sitting in the 
discussed part can be brought to the following pseudo-normal form 

dx\ — x 2 dt = dyi — y 2 dt = 

dt — x 3 dx 2 = dy 2 — y 3 dx 2 = 

dx 3 — (1 + Xi)dx 2 = dy 3 — y^dx 2 = (41) 

dx4 — (B + x^)dx 2 = dyi — {C + y<o)dx 2 = , 

and the issue is to reduce to zero the constants B and C . This will be done simultaneously, 
if starting for clarity from B. To that end, we propose to consider an artificially chosen 
subsystem - the left tower in (41). That is, 

dX l - X 2 dT = 

dT - X 3 dX 2 = 
dX 3 - (1 + X 4 )dX 2 = (42) 
dX 4 -(B + X 5 )dX 2 = 
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(we write capital letters because are going to make a substitution in (42)). This is a 
Goursat system living in the space R 6 (T, X±, . . . , X5). Although it has no singularities, 
the question of possible elimination of B in it formally resembles the setting in the proof 
of Theorem 17 in [ChM]. Therefore, we just adapt (with a shift in indices) the formulas 
derived there on pages 147-8: 

• T = t, Xi = -ff 2 + a;i, X 2 = -ft + x 2 

# X * = J^, X A = -^y - 1, 

m Y — £5 I Bjl+Xj) 2 r> 
^5 — 7 n \T ~T~ 7 7, — -D • 

(l-fxs) (l-f»s) 

The quickest way to check these is to evaluate d(T,X±, . . . , Xs)(t, x\, . . . , on the 
vector field [x 3 , x 2 x 3 , 1, 1 + x 4 , x 5 , 0] T and to get 

(l - |x 3 ) [X 3 , X 2 X 3 , 1, 1 + X 4 , 5 + X 5 , 0] T + (*)d/cfe 5 

with a function (*) whose properties are irrelevant. Continuing the proof of Proposition, 
we need to find Y 1; . . . , Y 5 , Y,- depending ont,x i ,y 1 , . . . , Xj,yj (j = 1, . . . , 5) that together 
with the already proposed T, X ± , . . . , X 5 are the components of a local diffeomorphism 
$ = (T, Xi, Yi, ... , X 5 , Y 5 ) that should conjugate (41) to another object of the type (41) 
with the model values B = C = 0. 6 

Precisely we require that 

(|) d(T, Xl, Yi . . . , X 5 , Y 5 )(t, xi, yi,..., x 5 , y 5 ) taken on the vector field 

[x 3 , x 2 x 3 , y 2 x 3 , 1, j/3, l + x 4 , J/4, x 5 , 2/5, 0, 0] T 

be the multiplicative coefficient (l — -§a; 3 ) times the vector field 

[X 3 , X 2 X 3 , Y 2 X 3 , 1, Y 3 , I+X4, Y 4 , £ + X 5 , C + Y 5 , 0, 0] T 

modulo ((9/(9x5, d/dys). (The coefficient (l — -fx 3 ) is prompted by the computations in 
[ChM].) 

The main relation implied by the conjugacy (f) is 

d J± + d J± U =C. (43) 
dx 2 dx 3 

Under (f), Y 4 gets expressed by Y 3 , and, after a short calculus, (43) boils down to 

« + £4 +2 /£_ + £*| = o . (44) 



<9t ' <9x 2 2 dx 2 dx 3 dx 3 



6 Note that -X3 is, as it should be, a multiple of £3, meaning preservation, by the sought diffeo <f>, of 
the set {F(-) D D 2 (-)} that is {x 3 = 0} for both germs. 
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In turn, still under ($), Y 3 gets expressed by Y 2 , 



dY 2 dY 2 dY 2 dY 2 dY 2 / B \ 

X 3^T + X ^3T, 1" ^32/2^ = [1-—X3)Y 3 , (45) 

at ox i oyi dx 2 dy 2 V 3 / 

showing under way that 

^|o = (46) 
dx 2 

is a must in the problem. Under (f), also Y 2 gets expressed by Yi, 

^ ar, ^ 

^ + X2 ^ + 2/2 ^ = F2 ' (47) 

which in turn implies another necessary condition 

dY 1 , 



dt 



. (48) 



Our objective is to write (44) in a simpler way and so get some hints concerning terms that 
are important in the expansion of Y\. (The components T, Xi, Y\ are the most important 
in $, as they entirely determine $. We know T and X 1; while Y\ remains to be proposed.) 
Towards that aim, note that | o = ^ | o — t^t I o> by applying, consecutively, (45) 
and (47) . Consequently - the key moment - we stipulate that 

^l„ = 0. (49) 

This clearly implies 1 = 0. It also implies as if for free, 

dY 3 



dx 2 



o = (50) 



(because, under ($), ||| + | = 0). The reader may observe at this point that (49) 
and (|) together are rather powerful. 

Back in the main line of arguments, the LHS of (45) is an affine function in x 3 , hence its 
second derivative with respect to x 3 vanishes identically. On the RHS of (45), it implies 
that 

2B dY 3 d 2 Y 3 







3 dx 3 dx 3 





d 2 Y 3 


o — 


dx 3 


d z Y 2 




9X2 


o = 



It is also quick to infer from (45) that 4^ | o = ^nr | o = (Y 2 is affine in x 2 , compare 
(47) ). All in all, under (49), the relation (44) assumes the form 

m „ d 2 Y, 



3 +2^-^- lo = C. 



dt dx 2 dx 3 
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Expressing it in terms of Y 2 , the first summand on the LHS is, by (45), equal to j^ 2 | o ; 
while the second can be got via differentiating (45) sidewise with respect to x 2 and x 3 , 

d 2 Y 2 dY 2 , BdY 3 d 2 Y 3 . d 2 Y 3 



2 UI 2 . D UI 3 

+ o I — — TT o T 7^ ^ | — a | i 



dtdx 2 dx\ 3 dx 2 dx 2 dx 3 dx 2 dx 3 

with (50) accounting for the last equality. The basic relation (43) thus becomes 

.*n + gk\, = c . (5i) 



dtdx 2 dxi 

Endly, (47) directly implies that §g | = |o and | = -Q^ | o, reducing (51) 

to 

5 £i» = c - < 52 > 

provided that (f), (46), (48) and (49) simultaneously hold. 

The relation (52) is a mayor step in the proof, yet the formula ^tx\ alone would 
not do for the component Y lt for one strives to construct a local diffeomorphism around 
e R 11 . But it is safe to take Y ± — y x + f tx\ and, following (47), Y 2 = y 2 + §£i + ^tx 2 . 
The additional requirements (46), (48) and (49) clearly hold for these proposed functions, 
while the whole approach is so developed as to obey (f). For reader's convenience, here 
are the formulas for the two next Y components. Y 3 is computed according to (45), 

(, B V 1 ( C 2C 
Y 3 = 1 - — x 3 \y 3 + —t + —x 2 x 3 



3 ) v 5 5 

and I4 is - under (f ) - a precise product derived from Y 3 , 

Y A = \l-—x 3 \ I y A + — x 3 + — x 2 (l +x 4 ) 1 

B ( B \ ^ ( C 2C \ 

+ -n--x 3 J {l + x 4 ) (y 3 + -t + —x 2 x 3 ) . (53) 

As regards the last component F 5 , there is no need to compute it: in the output EKR, 
the additive constant standing next to Y 5 is that given by the basic relation (43). That 
is, C. 7 

The diffeomorphism $ is now produced, and B, C can indeed be reduced to zero. 

□ 



7 One also directly sees that the function (53) substituted on the LHS of (43) produces the value C. 
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7.2 The orbit 1.2.1 s tan .l s tr a of codimension two. 

Any D from this part can, by Theorem 4 and an elementary rescaling, be written down 
under the pseudo-normal form 

dxi — x 2 dt = dyi — y 2 dt = 

dt — x 3 dx 2 = dy 2 — y?,dx 2 = 

dx 3 — x 4 dx 2 = dy 3 — (1 + y 4 )dx 2 = 

dx 4 — (1 + x 5 )dx 2 = dy 4 — {C + y$)dx 2 = , 

with certain constant C. The aim is to eliminate this constant. One starts, no wonder, 
from y 4 = y A + Cx 4 and computes dy 4 — (C + y^)dx 2 = dy 4 + Cc?X4 — (C + y$)dx 2 = 
dy 4 + C(l + £5)<ir2 — (C + y<o)dx 2 = dy 4 — (1/5 — Cx$)dx 2 , because dx 4 = (1 + x^)dx 2 in 
this pseudo-normal form. This prompts 2/5 = 2/5 + Cx 5 . Then, working still within the 
right tower, dy 3 - (1 + y 4 )dx 2 = dy 3 - C x 4 dx 2 - (1 + y 4 )dx 2 = dy 3 - C dx 3 - (1 + y 4 )dx 2 , 
because x 4 dx 2 = dx 3 for this differential system. This prompts y 3 = y 3 + Cx 3 . 
Similarly, upon substituting this expression for y 3 in dy 2 — y 3 dx 2 , one is led to write 
2/2 = 1/2 + C% then to substitute it to dy\ — y 2 dt, and eventually to write y\ = y 1 + ^t 2 . 
In the variables t, x±, . . . , x 4 , y 1 , . . . , y 4 the constant C disappears. □ 

7.3 The orbit 1.2.1 s tan .l s ta n of codimension three. 

This time, an arbitrary D from the 'doubly tangent' family can be written under the form 

dx\ — x 2 dt = dyi — y 2 dt = 

dt — x 3 dx 2 = dy 2 — y 3 dx 2 = 

dx 3 — x 4 dx 2 = dy 3 — (1 + y 4 )dx 2 = 

dx 4 — x 5 dx 2 = dy 4 — {C + y$)dx 2 = , 

with, again, a constant C that should be got rid of. We will effectively construct, giving 
detailed motivations first, new coordinates eating this C up. So searched is a local pre- 
serving G R 11 diffeo $ = (T, Xi, Yi, . . . , X 5 , Y 5 ) sending the EKR with C = to the 
one with any fixed value of C . 

That is, we demand this time that 

(ft) d(T, Xi, Yi . . . , X 5 , Ys)(t, xi, 2/1, ... , £5, 2/5) taken on the vector field 

[x 3 , x 2 x 3 , y 2 x 3 , 1, 2/3, z 4 , 1 + 2/4, ^5, 2/5, 0, 0] T 

be a function coefficient / times the vector field 

[X 3 , X 2 X 3 , Y 2 X 3 , 1, Y 3 , X 4 , I + F4, X 5 , C + Y 5 , 0, 0] T 

modulo (d/dx$, d/dy^), with / | 7^ 0. Note that / is not precised yet (in contrast to the 
treatment of the generic case) and will get concretized only at the end. Let us stipulate 
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additionally that / | = 1. Then the basic relation reads 



8x 2 dy, 
while (ff) implies 

8Y 3 8Y 3 8Y 3 dY 3 

x 3 ^-+x 2 x 3 - \~V2X 3T . h -= h 

at ox i oy\ 0x2 

dY 3 , dY 3 , „ 8Y 3 



m + ^\o = C, (54) 



y*7T + ^TT 1 + (1 + 2/4)^ = /(1 + n) • (55) 
dy 2 0x3 dy 3 



This relation allows to reduce (54) to 



df df , <9F 3 , «9 2 F 3 , d 2 Y 3 , a 2 F 3 



+ 2^- + ^#lo = ^. (56) 



dx 2 dy 3 dy 2 dx% dx 2 dy 3 dy% 

But (ff) implies also 

dY 2 dY 2 dY 2 dY 2 8Y 2 

x z^T ^x 2 x 3 - Vy 2 x 3 - V- h?/ 3 ^— = jY 3 

at 8x1 8yi 8x 2 8y 2 

which helps to further reduce (56). Namely, after careful computations that we skip here, 
that relation boils down to 



8x% + ^dx 2 dy 2 (&c 2 + dy 3 ) + ^ 8y 2 + 2 dx 2 ^ '° ^ ' 



Naturally, the objective is to descend further to indices 1 - to have only functions X 1 , Y x 
in the conditions for a conjugacy. Note that (due to the inclusion holding true in the 
2nd sandwich for both germs) the component X 3 is divisibe by x 3 , X 3 = x 3 G for certain 
function G, G | 07^ 0. Now we stipulate anew that 

fG = 1 identically (58) 

(so that, with one previous assumption, G \ o = 1). This and (ff) yield a compact expres- 
sion for Y 2 in terms of Yi, 

dY 1 8Y 1 8Y 1 

-wt + x 2-r \-V2^r- = Y 2 . 

dt 8x1 8y 1 

With its use, (57) gets reduced to 

3f , df\f 1 + 2 3Jl) lo = c (59) 



dx 2 8y 3 J V dy 

which still leaves something to be desired. But also / is expressable, under (ff), by the 
function X 2 alone: 

8X 2 8X 2 8X 2 8X 2 8X 2 

x 3 ^—+x 2 x 3 - h 2/22:3^ ^^3^— = /■ 

8t 8x\ 8y\ 8x 2 8y 2 
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On top of this, all the time under (ft) and (58), 

dx 1 ox 1 ox 1 

-^rr + x 2^ ^Vi-^— = x 2- 

at ox i oyi 

These premises suffice to reduce (59) ultimately to 



° Xs f l + 2^)\ = C. (60) 



This is a tremendous prompt and we are now about to finish. 

Indeed, one can take, simply, T — t, X\ — xi — ^y±, Y\ = y±, thus securing (60). Let 
us write down the remaining components, just going backwards along the presented line 
of arguments. Immediately we get X2 = xi — ^y2, Y 2 = 2/2, and X2 determines / = 
1 — jU3, which in turn determines X 3 = x 3 (l — jy 3 ) 1 . In parallel, (ff) determines 
Yz = 1/3(1 - jy 3 ) 1 , as well as 

Xi = x±(l-—yzj + —x 3 (l + y 4 )\l - — y 3 J . 



Now (55) quickly generates the key component Y4, 

(_\ 
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Y 4 = (l + y 4 )(l-^ 



which clearly satisfies (54). The proof is finished; there is no need to compute explicitly 
X 5 , Y 5 . Only as a matter of record, we note that, not surprisingly within 1.2.1_ Sj 
X4 e (x 3 , X4) (which is visible in the formula above) and X5 € (x%, X4, X5). □ 

Remark 5. It is precisely in this part of the singularity class 1.2.1.1 where we have 
detected an unexpected loss of stability under passing from k = 2 to k = 3; see Section 7.7 
for the details and interplay with the theory of singularities of curves. (Note that another, 
perfectly natural loss of stability is caused by the appearance of the new singularity class 
1.2.3.4 for k > 3, cf. [M3].) 

7.4 The orbit 1.2.1 +s .l s tra °f codimension three. 

To justify its being an orbit, there suffices just a repetition of the argument from the proof 
in Section 7.2. Indeed, when dealing with the preliminary normal form 

dxi — X2dt = dyi — y^dt = 

dt — x 3 dx 2 = dy2 — yzdx 2 = 

dx 3 — X/±dx2 = dy 3 — y^dx 2 = 

dx A - (1 + x 5 )dx 2 = - (C + ys)dx 2 = , 

and trying to eliminate the constant C, one performs the same transformations and uses 
virtually the same bar variables y as for the part 1.2.1_ Sita n-l-s,tra • D 
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7.5 The orbit 1.2.1 +s .1 s tan of codimension four. 

In view of Proposition 4, it is immediate to see that all such distribution germs are 
equivalent to the EKR 

dxi — x 2 dt = dyi — y 2 dt = 

dt — x$dx 2 = dy 2 — y^dx 2 = 

dx 3 — x^dx 2 = dyz — y^dx 2 = 

dx^ — x 5 dx 2 = dy4 — (1 + y§)dx 2 = . 

□ 



7.6 The orbit 1.2. 1. l +s of codimension five. 

The only EKR that has remained unused until this moment, and that services all strongly 
nilpotent germs in 1.2.1.1 (Proposition 4 again) is 

dxi — x 2 dt = dyi — y 2 dt = 

dt — Xsdx 2 = dy 2 — y%dx 2 = 

dxs — x^dx 2 = dy 3 — y^dx 2 = 

dx4 — x$dx 2 = dyi — y$dx 2 = . 

(In other words, within class 1.2.1.1 there holds the converse of the last item of Theorem 
4 in [M2].) □ 



7.7 Loss of stability when the width grows. 

The general ideology underlying the work on singularities of multi-flags is as follows. For 
any fixed k and r, there exists a huge 'monster' manifold M of dimension (r + l)k + 1 
and a universal rank-(/c + 1) distribution V on M generating a special A;-flag which 
realizes all possible local geometries of special /c-flags of length r - see Remark 3 in 
[M2]. In that way the points of M correspond to 'all' germs of rank-(/c + 1) distributions 
generating such flags. In fact, the couple (M, T>) is the outcome of a series of r so-called 
generalized Cartan prolongations (or rank-1 prolongations in the language of [SY]) started 
from (R k+1 , TR k+1 ). In parallel, smooth curves in IR fe+1 can also be Cartan-prolonged; 
their r-th prolongations lie in M. 

We want to give an example of prolongation of curves for k = 2 and r = 4. It will be 
in close relation with the orbit 1.2.1_ Sitan .l_ Sitan discussed in Section 7.3. Let us take the 
curve 7(s) = (t, Xi, yi)(s) = (s 4 , s 5 , s 6 ) that is excerpted from the list [GHo] of simple 
space curves. We compute its first prolongation, 

dx\ 5 dyi 3 2 

X 2 = —r- = ~S , y 2 = —r- = -S , 

dt 4 ' yi dt 2 ' 
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then second prolongation 



dt 16 o dy 2 12 

dx 2 5 dx 2 5 

and then third 

_ dx 3 _ 192 2 _ 48 

dx2 25 ' 25 

These results show that the third prolongation of 7 hits at s = the point-germ, on the 
relevant three-step monster manifold, with the additive constant || standing next to 2/4, 
and that 2/4 is identically zero on the prolonged curve. (The use of EKR's in this discussion 
is equivalent to taking a good coordinate chart in a piece of the monster.) Consequently, 
2/5 = ^ = in the fourth prolongation, while x 5 — jj^ — ^ffs. Indeed then, the fourth 
prolongation of 7 hits a germ in the orbit in question. That is, the model EKR with 
constants 1 (next to y 4 ) and C = is being hit by the fourth prolongation of the curve 

When one enlarges the underlying space from three to four dimensions, the curve 
(s 4 , s 5 , s 6 ) gets suspended to j(s) = (s 4 , s 5 , s 6 , 0) and keeps being simple. Yet its orbit 
becomes adjacent to the orbit of a less singular, also simple curve 7(5) = (s 4 , s 5 , s 6 , s 7 ); 
compare in [Ar] the lists of sporadic simple curves in dimension 4. Hence one gets two 
closely related, if non-equivalent, curves 7 and 7. The fourth prolongation of 7 hits at 
s = the EKR (61), given below, with D = 0. Whereas the fourth prolongation of 7 hits 
the member of (61) with D = |||. 

dx 1 — x 2 dt = dyi — y 2 dt = dz\ — z 2 dt = 

dt — x$dx 2 = dy 2 — y%dx 2 = dz 2 — z^dx 2 = 

dx 3 — x A dx 2 = dy 3 — (1 + yi)dx 2 = dz 3 — z 4 rfx 2 = (61) 

dx4 — x 5 dx 2 = dy A — y$dx 2 = dz± — (D + z 5 )dx 2 = 

From this non-equivalence of 4-dimensional curve germs one cannot automatically deduce 
that the respective EKR objects (61) are non-equivalent. Yet, surprisingly in the optics 
of special 2-flags, the constant D ^ in the EKR family (61) cannot be reduced to 0, 
indeed. It either vanishes or can be normalized to 1. This means that a single orbit in 
width 2, in width 3 consists of two different orbits. In other words, it splits up into two 
orbits when the width grows from 2 to 3. Thus, in width three, the class 1.2.1.1 splits 
up into at least seven orbits of the local classification! Reiterating, a proof of this loss 
of stability phenomenon does not follow from the curves' classification in [Ar] . It exceeds 
the scope of the present work and will be produced in [MPe2]. 

Attempting right now at a (tentative) conclusion, non-equivalences in the world of 
curves may firmly suggest probable non-equivalences of germs - points of the monster 
that are hit by Cartan prolongations of curves. It was not so in the case of [MPe2]. Had 
we noticed, however, the pertinent sporadic curves in [Ar] earlier, we would have worked 
towards the non-equivalence of D = and D 7^ in (61) in a more deterministic context. 
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8 Appendix 



We want to show that for any two different values c and c the distributions (2) are non- 
equivalent. Suppose the existence of a diffeomorphism 

$ = (T, X u Y u X 2 , Y 2 ,..., X 8 , Y 8 ) : (R 17 , 0) 

conjugating these two objects. The aim is to show that c — c. Clearly, 

• T, Xi, Y\ depend only on t, xi, y\ , 

• for 2 < j < 8, functions Xj, Yj depend only on t, x 1 ,y 1 , x 2 , y 2 , . . . , Xj, yy 

In the discussed situation one knows more about the components X 3 , X 5 , and X 7 : 

• X 3 (t, x u y u x 2 , y 2 , x 3 , y 3 ) = x 3 K(t, x 1} y u x 2 , y 2 , x 3 , y 3 ) , 

• X 5 (t, X!,..., y 5 ) = x 5 H(t, xi,..., x 5 , y 5 ) , 

• X 7 (t, xi, . . . , y 7 ) = x 7 G(t, xi, . . . , x 7 , y 7 ) 

for certain invertible at functions G, H, K. Moreover, the preservation of the distribution 
(d/dxs, d/dy$) implies that there must exist an invertible at function /, / | ^ 0, such 
that 

/ / / 

x 5 H 




ftp) 



x 7 G 



(62) 



where p = (t, x±, y±, . . . , xg, ys) and, for bigger transparence, the arguments in the func- 
tions G, H, K, X 2 , . . . , Ys on the RHS are not written. This vector relation entails the 
set of 15 scalar equations on the consecutive components d/dt, d/dxi, . . . , d/dx 7 , d/dy 7 ; 
we disregard the two last components - the components in the directions of L(D 6 ) C D' . 
In view of the first 11 components of $ depending only, recalling, on t, Xi,..., 
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1/5, the upper 11 among these scalar equations can be divided sidewise by x-j. Likewise 
and additionally, the upper 7 among them can be divided by x$, and the first three - 
additionally by x 3 . Agree to call thus simplified equations 'level T", 'level Xi, 'level X 7 \ 
etc, in function of the row of d<&(p) being involved. For instance, the level T equation is 
the <9/<9t-component scalar equation in (62) divided sidewise by the product 0:3X5X7. 

Because |^ \ = G \ , it follows from the level X 7 that 



while from the level X 6 one gets 



cG |o = cf\ , (63) 



In turn, the level X 5 can be written in a short form 

BX 

(*)x 5 + ^(1 + x 6 ) + (*)y 6 = fG(l + X 6 ) , (65) 
ox 5 

and, additionally, the level X 4 is the defining equation for the factor fG on the RHS in 
(65). In particular that level shows that fG depends only on t, xi, . . . , y$. Hence fG, as 
well as X5, do not depend on xq, and, moreover, \ q = H \ q. Now it is very quick to 
differentiate (65) with respect to x 6 at 0: 

H\ = fG^\ . (66) 

One is already half way through because, upon evaluating (65) at 0, 

H\ = fG\ (67) 

and this quantity is clearly non-zero. So (67), (66), (64) together imply 

/ I = 1 - (68) 

At this point the reader may feel already that, with one more constant 1 standing next 
to X4, this line of arguments can be repeated, with / replaced by fG and X$ replaced by 
X 4 . It is indeed the case (and simultaneously a kind of explanation that, for this type 
of argumentation, needed is nothing shorter than the class 1.2.1.2.1.2.1). To conclude the 
justification of a modulus, we are going to just write a sequence of relations holding true, 
with only short indications of sources for them. 

fG I = -77— - I from the level X 4 , 

<7X 4 

dX 

(*)x 3 + 1 r^(l + x A ) + (*)y 4 = fGH(l + X 4 ) (the level X 3 ) , 
ox % 
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fGH depends only on t, x\, . . . , y 3 (the level X 2 ) and — — \ — K \ , 

ox 3 

dX 

K |o = fGH— — |o (differentiating the level X 3 w.r.t. xA , 

0x4 

K 1 = fGH I 7^ evaluating the level X 3 at . 

fG I = 1 following from all the above facts . 

This last relation together with (68) say that / | = G | = 1- Now (63) boils down to 
c = c . The invariant character of the parameter c in (2) is shown. 

Remark 6. Note that an analogous proof in the space of 1-flags would be false. For, in 
the Goursat case, there is no second sandwich, they only commence by No 3. So one could 
not claim (as is done above) that the function X 3 is divisible by x 3 . And, besides, it is 
well known that in length seven the local classification of Goursat is still discrete. 
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